Technical field of the invention 

, m invention relates to data counnumcation and more particularly to a process for 

disseminating or distributing data through a network. 

Background art 

tas^-^-^*^***"^*"**?^^ 

access to a wide range of infarmarrorr, services mi dectromo documerrtaricsr. More and 
puMishera provide oraJira. electome dor*™**"", eoeks, «^ Ransom of eUc«»c»»dra 
15 files to customers connected to the network. 

As the size of the internet or a private intranet network increases, the need for fcstnW 
replication of an electronic file increases also. This capability is essential in a wide vanety of 
^tions. For instance, this is xe*nxed when W*^-— 
20 clusterisplannedandalso^ 

Web caches. When a new ver.cn of en operating system or other program * made avadable there 
. enormous need for quick replication of the software on a large number of different n»chme* 
which needs to be achieved without exce^ive stress on the web server and tire teleeonununicahon 



network. 
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Generally speaking, the transmission of a given file and its replication at different locations 
within a network is normally performed in a master/slave mode, mat is to say in a manner ^whrch 
concentrates the bnrden of the file copy on one central dilution point. In most situate, wnen the 
size of the electronic file to be transmitted is significant, and when there are a large number of 
^ients, this results in stress on bom the master device - eg. a web server - and also on the 
network. As an illustration, should one file of 500 Kbytes be provided to a number of 10O0 

that server and for the communication path. 

A technique known as JP multicast can in some circumstances be used to improve the 
situation but this, however, is not available on the public Internet network, because current* most 




routers do not \ 
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The use of a proxy can in some circumstances relieve both the publisher of the lectronic file 
and the network. As is well known, a proxy device functions tc avoid direct connection from one 
machine - generally within an Intranet network - to another machine being outside the Intranet 
5 network. The conimunication between the first and the second machine is achieved via the proxy 
which receives and forward all the requests on behalf of the former to the latter. 

Figure 1 illustrates the topology of a Intranet network which comprises a set of network 
devices - e.g. a router 9 allowing communication between a first subnetwork A and a second 
0 subnetwork B comprised within the Intranet, hut also the computers 6 and 7, a printer 8 ana. an 
intranet server 10. All the network devices of the Intranet communicate with the Internet network 3 
via a proxy element 5 allowing interface between the boundary of the Intranet and the Internet. It can 
therefore be seen that when a proxy is axranged within an Intranet network of a private organic*, 
for instance, all the requests from the machines of that Intranet network are first forwarded to the 
15 proxy, which accesses the remote machines and returns the results to the original requestor. Typically 
proxies are linked with other functionality, in particular me caching of web pages to reduce traffic so 
that one electronic file requested by a great number of machines belonging to one Intranet network is 
directly available without requiring multiple download operations from the publisher of the electronic 
files. Therefore, if one device - e.g. computer 6 requests one electronic file from an external web 
20. server 1 , the latter can be cached within Proxy 5 for later diffusion within the Intranet 

Summary of the invention 

This invention is directed to a process for distributing through a network an electronic 
25 message associated with a list of recipients, said process comprising the iterated application of the 
steps of: 

a) extracting from said list a first sublist and a second sublist of recipients; 

b) identifying a recipient within said first sublist which can be addressed via said network, and 

c) transmitting said electronic message and said first sublist to said identified recipient for onward 

30 distribution; and 

d) applying said steps (a) to (d) to said second sublist of recipients. 

The process can also comprise receiving the message and a list of recipients for onward 
distribution. The above described process can thus be carried out in a plurality of nodes of the 
35 network upon receipt at each node of the message and an associated first sublist generated in another 
of the nodes until the message has been transmitted to all recipients on the list 



HP50014107 



Tins process permits the burden of the transmission of the electronic document to be shared 
between different recipients, each of which reapplies the same process and reiterates using the 
original list of recipients until the electronic file or document is completely processed and transmitted 
to every recipient Preferably, the two sublets are arranged to have a similar siz and together 
5 comprising all the recipients listed in the list of recipients. 

In at least preferred embodiments, said message and the list of recipients are contained in a 
package the step of transmitting the electronic message to the identified recipient comprising 
generating a new package comprising the electronic message and the first list The package can be 
10 an XML document for instance, and can comprise information defining the size and the date of 
creation of said electronic message. 

In one embodiment, the transmission of the electronic package comprising the sublist is 
achieved via an HTTP channel, thus allowing easy transmission through the proxy of the Intranet 
15 networks. 

In one embodiment, the process is executed by an agent which is embodied by a 
computer program having its own installation procedure. Alternatively, the agent can be a specific 
component of an operating system allowing effective publishing functionality. 
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The invention provides a computer program product which permits distribution or flooding 
of the information throughout a telecommunication network, such as an Internet/intranet network for 



instance. 



25 Description of the drawings 



An embodiment of the invention will now be described, by way of example only, with 
reference to the accompanying drawings, wherein: 

30 Figure 1 illustrates a network including a a proxy system* 

Figure 2a and 2b respectively illustrate a computer system for performing a distribution 
process and a distribution control agent 25. 

Figure 3 illustrates the topology of five computers providing the distribution of an electronic 

file. 
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Figure 4 illustrates the process executed by a publisher. 
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Figure 5 illustrates the process executed by a control agent of anetwork device. 
Figure 6 illustrates the execution of the process on a topology of eight devices. 
Description of the preferred embodiment of the invention 

With reference to figure 2 there is shown a block diagram of a system which permits 
distribution of electronic files. The system is illustrated as residing within a computer system 20 
which may be a personal computer or any conventional suitably arranged computer system. The 
system is arranged to use existing electronic circuitry to perform the functions which will be 
described hereinafter. 

The computer 20 is illustrated as including a memory device 21 which may he a random 
access memory (RAM), Read-Only-Memory (ROM), a conventional mass storage, such as a hard 
disk drive or the like. The memory device 21 is conventionally capable of storing any type of date, 
hut is is particularly used in the present situation for achieving local replication of electronic files or 
electronic messages as will he explained hereinafter with details. In me following the term electronic 
messages will be used for simplicity to refer to the data that is to he disseminated within the network 
m whatever form and winch may be received and stored within the memory device 21. In typical 
applications this data may be for instance he files such as word processed or other documents, 
program files or media files (mp3, asx, pdf) files in accordance with known standard formats, but the 
application of the techniques described is not limited to any particular type of file, message or data. 
For instance, the process would be particularly useful for distributing media files such as elecHomc 
newspapers which need to be disseminated very rapidly. The mass storage may usefully be arranged 
to include a specific storage area (not shown) which is reserved for receiving the electronic files 
received and stored in accordance with the process which will be described with reference to figures 
4 and 5. 

The computer 20 is further illustrated as including processing circuitry 22 which generally 
governs the flow of software instructions and data within the computer system. In figure 2, the 
processing circuitry 22 is particularly arranged to provide read/write access to me memory device 21. 
Computer 20 further includes conventional input/output devices 23- 

Computer 20 further comprises communication circuitry 24. The communication circuitry 24 
may be, for instance, a modem allowing the computer 20 to communicate via a telecommunication 



network; such as an Internet network 3. It should be understood, however, that the type of 
communication circuitry with which compute 20 communicates with the Internet network - for 
instance to access a web server 1 with a database 2 - is not important In the case of an Internet 
connection, a TCP/IP communication layer is provided for supporting the communication layers 
5 which are involved in the communication steps of the process described herein. In most cases, the 
access to the Internet network is achieved via the World Wide Web via the well-known Hyper Text 
Transfer Protocol (HTTP). While HTTP is very useful, particularly because it is well adapted to the 
use of firewalls, any other kind of protocol allowing communication via die network can equally be 
used. 

10 

In addition to the conventional software components which are used for controlling the 
processing circuitry 22, and which generally includes the operating system and a set of application 
software, the computer 20 is provided with a distribution control agent 25 which consists of a set of 
processing instructions stored within the memory 2 1 and which are used for controlling the sequence 
15 of operations which will be described below. 

In this embodiment, the distribution control agent 25 is a specific software component 
provided with and embedded within the operating system installed within the machine. However, it 
will be understood that the functions described may be adapted to other configurations, such as in the 
20 form of a separately installed application program. 

hi this embodiment, distribution control agent 25 comprises two distinctive components: an 
emitter element 27 associated with a receiver element 26, as shown in figure 2b. The emitter element 
27 is arranged to communicate with corresponding receiver elements 26 of other computers which 
25 are accessible via the network 

With respect to figures 3 and 4 there will now be described the process for distributing a 
message. Computer 31 - including a distribution control agent 25 with one emitter (E) clement and 
one receiver (R) element - is assumed to transmit an electronic message to a set of recipients 

30 including recipient 32, 33, 34 and 35 shown in the figure 3. Recipients 32, 33, 34 and 35 are 
generally conventional computers such as illustrated in figure 2, but could also be more specific 
network devices allowing Internet access, such as mobile telephones, communicating wrist watches 
or similar appliaces, for instance. It is assumed that any device can communicate via any other 
device through the network, which is generally the case when each device is configured as a network 

35 node provided with an Interact Protocol (EP) address. 
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Figure 4 mare particularly shows the processing steps which are involved for achieving 
distribution of me messages. This process is launched by publishing server 3 1, for instance. 

la a step 41, distribution control agent 25 receives an electronic message to be distributed, 
5 fee latter being accompanied by a list of recipients (e.g. nodes 32-35 of figure 3), As mentioned 
above, the electronic message, can be received from an application program running within the 
computer. 

Upon reception of the message accompanied by the list of recipients, Agent 25 creates in a 
10 step 42 an electronic package containing the message to be distributed and the above mentioned list 
of recipients. In the case of a TCP/IP network, the recipients are defined by the IP addresses or by fee 
email addresses. Preferably, the electronic package which is created at step 41 is an XML file which 
complies wife fee standard extended Mark-up Language structure and which is associated with a 
Document Type Definition (DTD) file. The XML file preferably comprises additional information 
15 such as the date of creation of me electronic file. 

In a step 43, Agent 25 forwards fee XML package to its receive control clement 26 for 
further processing. 

20 The processing which follows after step 43 is fee general distribution processing which is 

depicted in figure 5 and which will now be described in detail. 

The process is launched when fee receive control element 26 of agent 25 receives in a step 51 
an XML package which is to be processed. 

Step 51 is then followed bv a step 52 where agent 25 reads fee container comprising the 
package and extracts the list of recipients therein included. If necessary, Agent 25 then removes 
itself from the list of recipients (not shown). 

30 In a sten 53. the agent 25 performs a test to determine whether the list of recipients is empty, 

in which case, fee process proceeds to a step 59 which is the completion of the process. 

However, in fee general case, fee list of recipients is not empty, and the process proceeds to a 
step 54 where agent 25 computes from the list of remaining recipients received in step 51 a set of two 
35 distinctive sublists: a first and a second sublist. First and second sublists share no common element 
and their joining permits fee original list of recipients which was included within the original 
package to be regenerated In one embodiment, fee two sublists are arranged to have substantially 
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the same size. More particularly, in me preferred embodiment, me first list has a number of elements 
which is set to be equal to the integer which is immediately superior to half the number of recipients 
of the original list With respect to the scheme of figure 3, it can be seen mat sublist 1 can include 
nodes 32 and 34, while sublist 2 includes nodes 33 and 35. In the particular case where the list is 
limited to a single recipient, (he process divides the list into a first and a second list, the second one 
being empty. 

In a step 55. agent 25 determines via transmit control element 27 one particular recipient 
within toe first sublist which can receive a notification via the network, for instance node 32 of 
sublist 1. Different methods and means can be arranged for achieving the above determination. 
Preferably, agent 25 issues a PING command to determine whether any particular recipient is 
available on the network. 



When the above determination completes, the agent 25 generates in a step ?6 a first 
15 subpackage comprising the original electronic document to distribute, accompanied with toe first 
sublist within an XML file which is formatted in a similar fashion to the original clectromc package 
which was created in step 41 of figure 4. In toe example of figure 3, it can be seen that first sublist 
includes nodes 32 and 34. While figure 5 clearly illustrates mat the determination of the particular 
recipient (step 55) precedes the generation of toe electronic package, it should be noted that toe 
generation of toe electronic package could be carried out before toe determination of toe appropriate 
recipient to which toe package will have to be transmitted. 



In a step 57. toe first subpackage is then forwarded to toe transmit control element 27 for 
further propagation through the network to toe recipient which was identified in step 54. This 

25 transmission is particularly illustrated in figure 3 by arrow 36 showing the propagation of the first 
subpackage from node 31 (toe publishing node) and node 32. Preferably, the HTTP protocol is used 
for this transmission to facilitate toe transmission through firewalls. Alternatively, this transmission 
can be achieved via the File Transfer Protocol or any other suitable protocol. Once toe recipient 
receives toe first subpackage, toe latter will correspondingly initiate toe general distribution process 

30 of figure 5. 

In a stop 58. agent 25 generates a second subpackage comprising toe original electronic 
document or message, accompanied by the second sublist, which is formatted within a XML file and 
forwarded to toe receive control element 26 of the same node and same agent. In toe example drawn 
35 in figure 3, it can be seen that toe second package has a second sublist including nodes 33 and 35. 
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The process then proceeds back to step 5 1 for the purpose of processing this new electronic 
package in accordance with the sequence of steps 51-58 which were described above. Thispermits a 
new pair of subliste to be computed, respectively a sublist containing node 35 and a second subhst 
containing node 33 permitting later transmission of the electronic file. Node 31 will therefore 
5 successively transmit the electronic document to node 35 (arrow 38) and then to node 33 (arrow 39). 
Node 34 will directly receive the electronic file from a direct transmission (arrow 37) from node 32. 

It should be noted that the process of figure 5 which was described in detail above, is 
executed in each agent of the different machines which are involved in the distribution process and 

LO which receive the package. Therefore, the process is successively executed in the different recipients 
until the full distribution of the original document or file is achieved. Further, it can be seen that the 
number of replications of foe original electronic documents is shared between the different recipients 
(ie nodes 3 1 and 32 in figure 3) of foe different consecutive sublists which are computed. Neither foe 
originating publishing server, nor foe network is disproportionately stressed or loaded by foe 

15 distribution of foe document, even if a greatnumber of recipients is to be reached. 

More particularly, it can be seen that foe number of successive downloads which is to be 
managed and handled by foe publishing server is decreased in a ratio of log2 with respect to the 
number of machines which are to receive foe original message. This can be seen with more clarity m 

20 figure 6 illustrating foe process for 7 nodes. Assuming that foe original list of recipients includes 
nodes 61 , 62, 63, 64. 65, 66 and 67, it can be seen that foe first sublist which is generated in step 54 
of figure's comprises nodes 62. 66 and 67 while the second subhst which is generated includes 61, 
63 64 and 65. When the first subpackage (containing &e first sublet) is transmitted to node 62, the 
originating node 61 only has to process a second subpackage comprising a second sublist which is 

25 reducedto contain nodes 61, 63, 64 and 65. 

At me next loop, when node 61 receives foe second subpackage, a new iterative proeeH is 
executed, what results in the computation of a new set of two subpackage*, respectively containing 
nodes 64-65 and nodes 63, and foe transmission of foe former for further processing to node 64. It 
can then be seen that node 6 1 no longer has to transmit foe electronic file to node 65 because node 65 
receives foe message via from the processing of the above mentioned subpackage by node 64. 

We foe invention was particularly described with reference with a computer, it should be 
clear that foe process may be adapted to any device which can communicate with foe network, and 
which can receive an electronic message. Such devices may include mobile telephones and more 
generally any device which can be fitted with a distribution control agent, such as agent 25, for the 
purpose of executing the steps of process illustrated in figure 4 and 5. 
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